Sequential Labeling with online Deep Learning

نویسندگان

  • Gang Chen
  • Sargur N. Srihari
چکیده

In this paper, we leverage both deep learning and conditional random fields (CRFs) for sequential labeling. More specifically, we propose a mixture objective function to predict labels either independent or correlated in the sequential patterns. We learn model parameters in a simple but effective way. In particular, we pretrain the deep structure with greedy layer-wise restricted Boltzmann machines (RBMs), followed with an independent label learning step. Finally, we update the whole model with an online learning algorithm, a mixture of perceptron training and stochastic gradient descent to estimate parameters. We test our model on different challenge tasks, and show that this simple learning algorithm yields the state of the art results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Word Embedding and Topic Modeling Enhanced Multiple Features for Content Linking and Argument / Sentiment Labeling in Online Forums

Multiple grammatical and semantic features are adopted in content linking and argument/sentiment labeling for online forums in this paper. There are mainly two different methods for content linking. First, we utilize the deep feature obtained from Word Embedding Model in deep learning and compute sentence similarity. Second, we use multiple traditional features to locate candidate linking sente...

متن کامل

Learning in the Deep-Structured Conditional Random Fields

We have proposed the deep-structured conditional random fields (CRFs) for sequential labeling and classification recently. The core of this model is its deep structure and its discriminative nature. This paper outlines the learning strategies and algorithms we have developed for the deep-structured CRFs, with a focus on the new strategy that combines the layer-wise unsupervised pre-training usi...

متن کامل

Online End-of-Turn Detection from Speech Based on Stacked Time-Asynchronous Sequential Networks

This paper presents a novel modeling called stacked timeasynchronous sequential networks (STASNs) for online endof-turn detection. An online end-of-turn detection that determines turn-taking points in a real-time manner is an essential component for human-computer interaction systems. In this study, we use long-range sequential information of multiple time-asynchronous sequential features, such...

متن کامل

Backpropagation in Sequential Deep Neural Networks

Most previous work applying neural networks to problems in speech processing has combined the output of a static network trained over a sliding window of input with an HMM or CRF to model linear-chain dependencies in the output. The recently developed Sequential Deep Neural Network (SDNN) model allows sequential dependencies between internal hidden units, allowing them to potentially detect lon...

متن کامل

A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning

Sequential prediction problems such as imitation learning, where future observations depend on previous predictions (actions), violate the common i.i.d. assumptions made in statistical learning. This leads to poor performance in theory and often in practice. Some recent approaches (Daumé III et al., 2009; Ross and Bagnell, 2010) provide stronger guarantees in this setting, but remain somewhat u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1412.3397  شماره 

صفحات  -

تاریخ انتشار 2014